Fixed UDF jar metadata handling in `UDFInfo` when multiple UDFs share the same jar by Caideyipi · Pull Request #17732 · apache/iotdb

Caideyipi · 2026-05-21T02:54:24Z

Description

This PR fixes UDF jar metadata handling in UDFInfo when multiple UDFs share the same jar.

Background

UDFInfo previously tracked uploaded jars only with jarName -> md5. When a UDF was dropped, the jar metadata was
removed immediately. This breaks the case where multiple UDFs reference the same jar:

dropping one UDF can make ConfigNode think the shared jar no longer exists
later validation may no longer detect conflicting MD5 values for the same jar name
after snapshot load, jar metadata is restored but shared-jar reference state is not rebuilt, so the same issue can
reappear after restart

Changes

This PR introduces reference counting for shared UDF jars in UDFInfo.

add existedJarToReferenceCount to track how many UDFs are using each jar
update UDF creation flow to increase jar reference count instead of only recording MD5
update UDF drop flow to remove jar metadata only when the last reference is removed
rebuild jar metadata and reference counts from the UDF table after loading a snapshot

This keeps shared jar metadata consistent across normal create/drop operations and snapshot recovery.

Tests

Updated UDFInfoTest to cover the shared-jar cases:

dropping one UDF does not remove jar metadata if another UDF still references the same jar
validation still rejects the same jar name with a different MD5 after one reference is dropped
snapshot load rebuilds shared-jar metadata correctly, and subsequent drop behavior remains correct

This PR has:

Key changed/added classes (or packages if there are too many classes) in this PR

codecov · 2026-05-21T04:13:17Z

Codecov Report

❌ Patch coverage is 98.91304% with 1 line in your changes missing coverage. Please review.
✅ Project coverage is 40.78%. Comparing base (7563ac8) to head (678ffa4).
⚠️ Report is 51 commits behind head on master.

Files with missing lines	Patch %	Lines
...mons/executable/ReferenceCountedJarMetaKeeper.java	98.38%	1 Missing ⚠️

Additional details and impacted files

@@             Coverage Diff              @@
##             master   #17732      +/-   ##
============================================
+ Coverage     40.55%   40.78%   +0.23%     
- Complexity     2574     2610      +36     
============================================
  Files          5179     5187       +8     
  Lines        349896   351466    +1570     
  Branches      44727    44999     +272     
============================================
+ Hits         141890   143339    +1449     
- Misses       208006   208127     +121

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

luoluoyuyu

Review summary

Reference counting for shared JARs fixes incorrect removal of existedJarToMD5 when only one of several UDFs is dropped. rebuildJarMetadataFromUDFTable() keeps jar metadata consistent after snapshot load. New unit tests cover the shared-jar and snapshot paths.

Two small suggestions below; otherwise this looks good to merge.

luoluoyuyu · 2026-05-27T08:02:33Z

  }
+
+  private void addJarReference(String jarName, String jarMD5) {
+    existedJarToMD5.putIfAbsent(jarName, jarMD5);


putIfAbsent keeps the first MD5 for a jar name but always increments the reference count. That is fine when validate() runs before addUDFInTable, but if addJarReference is ever called without that check, ref count and MD5 could diverge.

Consider rejecting a conflicting MD5 inside addJarReference (same check as in validate() at lines 107-115) so this helper is safe on its own.

luoluoyuyu · 2026-05-27T08:02:33Z

      deserializeExistedJarToMD5(fileInputStream);

      udfTable.deserializeUDFTable(fileInputStream);
+      rebuildJarMetadataFromUDFTable();


After deserializeExistedJarToMD5, rebuildJarMetadataFromUDFTable() clears both maps and rebuilds from udfTable. That makes udfTable the source of truth on load.

A short comment here explaining that deserialized existedJarToMD5 is intentionally discarded would help future readers.

jt2594838

May unify the jar management of UDF and PIPE (and even more).

jt2594838 · 2026-06-01T01:44:55Z

+  public synchronized void serializeJarNameToMd5AndReferenceCount(final OutputStream outputStream)
+      throws IOException {
+    ReadWriteIOUtils.write(jarNameToMd5Map.size(), outputStream);
+    for (final Map.Entry<String, String> entry : jarNameToMd5Map.entrySet()) {
+      final String jarName = entry.getKey();
+      ReadWriteIOUtils.write(jarName, outputStream);
+      ReadWriteIOUtils.write(entry.getValue(), outputStream);
+      ReadWriteIOUtils.write(jarNameToReferenceCountMap.getOrDefault(jarName, 0), outputStream);
+    }
+  }


Is it possible or necessary to serialize an entry whose count is zero?

Good point. A zero reference count should not be serialized: under the normal add/remove path, removeReference removes both maps when the count reaches zero. I updated the snapshot logic to serialize only entries with a positive reference count and matching md5 metadata, and to skip non-positive counts when loading snapshots as a defensive cleanup. Added a regression test for the zero-count snapshot case as well.

sonarqubecloud · 2026-06-03T03:38:08Z

Quality Gate passed

Issues
2 New issues
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

… the same jar (#17732) (#17835) * UDF Fix * sp * fix * Filter invalid jar reference counts in snapshots

Caideyipi added 2 commits May 21, 2026 10:51

UDF Fix

e822a94

sp

2c62c39

This comment was marked as outdated.

Sign in to view

luoluoyuyu reviewed May 27, 2026

View reviewed changes

jt2594838 approved these changes May 29, 2026

View reviewed changes

fix

e340b75

jt2594838 reviewed Jun 1, 2026

View reviewed changes

Filter invalid jar reference counts in snapshots

678ffa4

jt2594838 approved these changes Jun 3, 2026

View reviewed changes

jt2594838 merged commit 709145c into master Jun 3, 2026
47 of 50 checks passed

jt2594838 deleted the udf-fix branch June 3, 2026 08:53

Caideyipi mentioned this pull request Jun 3, 2026

[dev/1.3] Fixed UDF jar metadata handling when multiple UDFs share the same jar #17835

Merged

jt2594838 pushed a commit that referenced this pull request Jun 4, 2026

Fixed UDF jar metadata handling in UDFInfo when multiple UDFs share…

4052320

… the same jar (#17732) (#17835) * UDF Fix * sp * fix * Filter invalid jar reference counts in snapshots

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixed UDF jar metadata handling in `UDFInfo` when multiple UDFs share the same jar#17732

Fixed UDF jar metadata handling in `UDFInfo` when multiple UDFs share the same jar#17732
jt2594838 merged 4 commits into
masterfrom
udf-fix

Caideyipi commented May 21, 2026

Uh oh!

codecov Bot commented May 21, 2026 •

edited

Loading

Uh oh!

This comment was marked as outdated.

Uh oh!

luoluoyuyu left a comment

Uh oh!

luoluoyuyu May 27, 2026

Uh oh!

luoluoyuyu May 27, 2026

Uh oh!

jt2594838 left a comment

Uh oh!

jt2594838 Jun 1, 2026

Uh oh!

Caideyipi Jun 3, 2026

Uh oh!

sonarqubecloud Bot commented Jun 3, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

Caideyipi commented May 21, 2026

Description

Background

Changes

Tests

Key changed/added classes (or packages if there are too many classes) in this PR

Uh oh!

codecov Bot commented May 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

This comment was marked as outdated.

Uh oh!

luoluoyuyu left a comment

Choose a reason for hiding this comment

Review summary

Uh oh!

luoluoyuyu May 27, 2026

Choose a reason for hiding this comment

Uh oh!

luoluoyuyu May 27, 2026

Choose a reason for hiding this comment

Uh oh!

jt2594838 left a comment

Choose a reason for hiding this comment

Uh oh!

jt2594838 Jun 1, 2026

Choose a reason for hiding this comment

Uh oh!

Caideyipi Jun 3, 2026

Choose a reason for hiding this comment

Uh oh!

sonarqubecloud Bot commented Jun 3, 2026

Quality Gate passed

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

codecov Bot commented May 21, 2026 •

edited

Loading